Model Selection

Multi-domain Pretraining

# Multi-domain Pretraining

Tinyllama V1.1 Chinese

TinyLlama is a 1.1-billion-parameter small language model that adopts the same architecture and tokenizer as Llama 2, suitable for resource-constrained application scenarios.

Large Language Model

Transformers English

A 13-billion-parameter language model for Russian, pretrained on 300GB of multi-domain data with Russian perplexity around 8.8

Large Language Model

Transformers Supports Multiple Languages

Distilbert Mlm Best

DistilBERT is a lightweight distilled version of BERT, retaining 97% of BERT's performance while being 40% smaller and 60% faster.

Large Language Model

vocab-transformers

ProcBERT is a pre-trained language model specifically optimized for process texts, pre-trained on a large-scale corpus of process texts (including biomedical literature, chemical patents, and cooking recipes), demonstrating outstanding performance in downstream tasks.

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase